Audiovisual Head Orientation Estimation with Particle Filtering in Multisensor Scenarios

نویسندگان

  • Cristian Canton-Ferrer
  • Carlos Segura
  • Josep R. Casas
  • Montse Pardàs
  • Javier Hernando
چکیده

This article presents a multimodal approach to head pose estimation of individuals in environments equipped with multiple cameras and microphones, such as SmartRooms or automatic video conferencing. Determining the individuals head orientation is the basis for many forms of more sophisticated interactions between humans and technical devices and can also be used for automatic sensor selection (camera, microphone) in communications or video surveillance systems. The use of particle filters as a unified framework for the estimation of the head orientation for both monomodal and multimodal cases is proposed. In video, we estimate head orientation from color information by exploiting spatial redundancy among cameras. Audio information is processed to estimate the direction of the voice produced by a speaker making use of the directivity characteristics of the head radiation pattern. Furthermore, two different particle filter multimodal information fusion schemes for combining the audio and video streams are analyzed in terms of accuracy and robustness. In the first one, fusion is performed at a decision level by combining each monomodal head pose estimation, while the second one uses a joint estimation system combining information at data level. Experimental results conducted over the CLEAR 2006 evaluation database are reported and the comparison of the proposed multimodal head pose estimation algorithms with the reference monomodal approaches proves the effectiveness of the proposed approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Head Orientation Estimation Using Particle Filtering in Multiview Scenarios

This paper presents a novel approach to the problem of determining head pose estimation and face 3D orientation of several people in low resolution sequences from multiple calibrated cameras. Spatial redundancy is exploited and the head in the scene is approximated by an ellipsoid. Skin patches from each detected head are located in each camera view. Data fusion is performed by back-projecting ...

متن کامل

Human-Activity Analysis in Multimedia Data

Many important applications in multimedia revolve around the detection of humans and the interpretation of their behavior. These include surveillance and intrusion detection, video conferencing applications, assisted living applications, and automatic analysis of sports videos, broadcasts, and movies, to name just a few. Success in these tasks often requires the integration of various sensor or...

متن کامل

Comparison of Sampling-Based Algorithms for Multisensor Distributed Target Tracking

Nonlinear filtering is certainly very important in estimation since most real-world problems are nonlinear. Recently a considerable progress in the nonlinear filtering theory has been made in the area of the sampling-based methods, including both random (Monte Carlo) and deterministic (quasi-Monte Carlo) sampling, and their combination. This work considers the problem of tracking a maneuvering ...

متن کامل

Model-Free Head Pose Estimation Based on Shape Factorisation and Particle Filtering

Head pose estimation is essential for several applications and is particularly required for head pose-free eye-gaze tracking where estimation of head rotation permits free head movement during tracking. While the literature is broad, the accuracy of recent vision-based head pose estimation methods is contingent upon the availability of training data or accurate initialisation and tracking of sp...

متن کامل

Head Pose Estimation System Based on Particle Filtering with Adaptive Diffusion Control

In this paper, we propose a new tracking system based on a stochastic filtering framework for reliably estimating the 3D pose of a user’s head in real-time. Our system estimates the pose of a user’s head in each image frame whose 3D model is automatically obtained at an initialization step. In particular, our estimation method is designed to control the diffusion factor of a motion model adapti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • EURASIP J. Adv. Sig. Proc.

دوره 2008  شماره 

صفحات  -

تاریخ انتشار 2008